Instance-Based Skew Estimation of Document Images by a Combination of Variant and Invariant

نویسندگان

  • Seiichi Uchida
  • Megumi Sakai
  • Masakazu Iwamura
  • Shinichiro Omachi
  • Koichi Kise
چکیده

A novel technique for estimating geometric deformations is proposed and applied to document skew (i.e., rotation) estimation. The proposed method possesses two novel properties. First, the proposed method estimates the skew angles at individual connected components. Those skew angles are then voted to determine the skew angle of the entire document. Second, the proposed method is based on instancebased learning. Specifically, a rotation variant and a rotation invariant are learned, i.e., stored as instances for each character category, and referred for estimating the skew angle very efficiently. The result of a skew estimation experiment on 55 document images has shown that the skew angles of 54 document images were successfully estimated with errors smaller than 2.0 degree. The extension for estimating perspective deformation is also discussed for the application to camera-based OCR.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Instance-Based Skew Detection of Document Images by a Combination of Variant and Invariant

A novel deformation estimation technique is proposed and applied to document skew estimation. The proposed method has two properties. First, it utilizes an invariant and a variant of a target deformation to be estimated. Second, it is an instance-based method where the deformation is estimated by referring stored instances which describe the relation among the deformation, the variant, and the ...

متن کامل

Skew Estimation by Parts

This paper proposes a new part-based approach for skew estimation of document images. The proposed method first estimates skew angles on rather small areas, which are the local parts of characters, and subsequently determines the global skew angle by aggregating those local estimations. A local skew estimation on a part of a skewed character is performed by finding an identical part from prepar...

متن کامل

Document Analysis And Classification Based On Passing Window

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

متن کامل

A Fast and Novel Skew Estimation Approach using Radon Transform

In this paper, an effective and reliable skew estimation technique for machine printed documents and photos using radon transform is proposed and is compared with other methods used for skew estimation such as Fast Fourier Transform (FFT), Hough Transform (HT), combination of Horizontal Projection Profile (HPP) and Hough Transform, combination of Gabor filter and Radon transform, combination of...

متن کامل

Pseudo Zernike Moment-based Multi-frame Super Resolution

The goal of multi-frame Super Resolution (SR) is to fuse multiple Low Resolution (LR) images to produce one High Resolution (HR) image. The major challenge of classic SR approaches is accurate motion estimation between the frames. To handle this challenge, fuzzy motion estimation method has been proposed that replaces value of each pixel using the weighted averaging all its neighboring pixels i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007